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^ ■ Abstract 
(N 

5— ( I In this manuscript, we propose a novel 11 -point cyclic convolution algorithm based on alternate 

Qh. 

. Fourier transform. With the proposed bilinear form, we construct a length-2047 cyclotomic FFT. 

o 

^ ■ I. Introduction 

Discrete Fourier transforms (DFTs) over finite fields have widespread applications in error correction 
c/3 ! coding [1]. For Reed-Solomon (RS) codes, all syndrome-based bounded distance decoding methods 

involve DFTs over finite fields [1]: syndrome computation and the Chien search are both evaluations of 



> 



polynomials and hence can be viewed as DFTs; inverse DFTs are used to recover transmitted codewords 
in transform-domain decoders. Thus efficient DFT algorithms can be used to reduce the complexity 
of RS decoders. For example, using the prime-factor fast Fourier transform (FFT) in [2], Truong et 
■ al. proposed [3] an inverse-free transform-domain RS decoder with substantially lower complexity than 



(N 



OO ! time-domain decoders; FFT techniques are used to compute syndromes for time-domain decoders in [4]. 

o 

Cyclotomic FFT was proposed recently in [5] and two variations were subsequently considered in [6], 
^ , [7]. Compared with other FFT techniques [2], [8], CFFTs in [5]-[7] achieve significantly lower mul- 

' tiplicative complexities, which makes them very attractive. But their additive complexities (numbers of 

additions required) are very high if implemented directly. A common subexpression elimination (CSE) 
algorithm was proposed to significantly reduce the additive complexities of CFFTs in [9]. Along with 
those full CFFTs, reduced-complexity partial and dual partial CFFTs were used to design low complexity 
RS decoders in [10]. The lengths of CFFTs in [9] are only up to 1023 while longer CFFTs are required to 
decode long RS codes. To pursuit a length-2047 CFFT, 11 -point cyclic convolution over characteristic-2 
fields is necessary, which is not readily available in the literature. 

In this manuscript, we first propose a novel 11 -point cyclic convolution for characteristic-2 fields in 
Section HIl Based on this cyclic convolution, a length-2047 CFFT is presented in Section Hill Using the 
same approach, CFFTs of any lengths that divide 2047 can also be constructed. 
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II. 11 -Point Cyclic Convolution over Characteristic-2 Fields 



We first derive a fast cyclic convolution of 1 1 points over the real field. Denote the cyclic convolution 
of 11 point sequences x and y by the sequence zQ The sequence z may be computed by Fourier 
transforming x and y, multiplying the transforms point-by-point and finally, inverse Fourier transforming 
the product sequence. Let X, Y and Z denote the Fourier transforms of x, y and z respectively. As 
defined by the Fourier transform, 



10 

i=0 



10 

i=0 



(1) 



We express the rest components, X' and Y' (over reals) using the basis (1, W, W'^, . . . , W^) where W 
denotes the 11th primitive root of unity. This basis is sufficient because W^^ — 1 = yields W^^ = 



W^. Thus, X' = ^^^0 X^W^ and Y' = J^Lo ^/^*' which 
X'- = {xi - xio) Y- = {yi - yio). 



(2) 



We will call the vector (Xq, Xq, X[, . . . ,Xg) as the Alternate Fourier transform (AFT) of sequence 
X. Note that AFT is simply the DFT components Xq and X' in their special bases. From ([T|) and ^ 
it is obvious that the AFT computation may be described as a multiplication with a 11 x 11 matrix B 
with structure 



B 



1 1 



^10 



1 1 
-1 
-1 

-1 

where /lo is a 10 x 10 identity matrix. Alternately, given the AFT of x, one can determine x by using 
matrix B~^ given by 



1 



B ' = - 
11 



1 Ai 
A2 A3 

where length-10 row Ai = (10, —1, —1, . . . , —1), length-10 column A2 = (1, 1, 
submatrix A3 has 10 on the first upper diagonal and -1 everywhere else. 
Now consider the product of B^^ and an AFT vector: 



(3) 

l)"^ and 10 X 10 



B 



-1 





1 


1 Ai 




Uo 




Vo 


U' 


~ 11 


A2 A3 




U' 




V 



'in this manuscript, vectors and matrices are represented by boldface letters, and scalars by normal letters. 
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where Uq, U', and Vq, V' are appropriate partitions of the AFT and the signal vectors. Values of Vq 
and V can be computed as Vq = (l/ll)C/o + (l/ll)Ai?7' and V = {l/ll) A2UQ + {I /II) A^U' . Note 
that Ai and A3 are related as Ai = —(1, 1, . . . , 1) A3. This implies that the sum of the components of 
(1/11)A3?7' gives —{\/\l)AiU'. Furthermore, A2 contains only I's. Thus the computation of Vq and 
V' reduces to 

Fo = (l/ll)t/o-(l/ll)V(A3C/') 

(4) 

V = (l/ll)[C/o, f/o, . . . , C/o]^ + (1/11)A3J7'. 
Relation ^ shows that the inverse of an AFT only needs an evaluation of {1/11) A^U' . 

To compute cyclic convolution of x and y, one should multiply the Fourier transforms of x and y and 
then take the inverse Fourier transform of the product. We use AFT instead of classical Fourier transform. 
Multiplying Xq and Yq is simple, but since X' and Y' are expressed in a basis with 10 elements, their 
product may be difficult. Similarly inverse AFT requires multiplication by matrix A3 which may be 
complicated. However, we now show that both these two difficult computation stages are equivalent to 
only a Toeplitz product (i.e., product of a Toeplitz matrix and a vector) [11]. 

The pointwise multiplication results are Zq = XqYq and Z' defined as 

^X.'VFM r£YlW^ =Y.^[W\ (5) 

Vector Z' can be computed through the matrix product (Zq, Z[, . . . , Zg)"^ = M{Xq, X[, . . . , Xg)'^ where 
the elements of matrix M are 

Mk,j = n'-, + - ^iVr (6) 

Note that in Y- are considered as zero outside its valid range, i.e., y/ = if i < or i > 9. The 
terms in Q are easy to deduce from ([5]l. Matrix element Mj^j sums up those terms in Y' that after 
multiplication with X'jW^ result in W'^ terms. For example, since product of XjW^ and Yl_-W^~^ 
results in X'-Yl,_-W^ , we get the first term in Q as given. Second term of Q can be similarly argued. 
The third term is due to the product {X'-W^){Y;^_jW^^-^) = X'-YIq_jW^^ = -X'-Y{q_- ^^^q W\ 

Computing inverse DFT of Z requires one to multiply A3 and vector (Zq, ZJ, . . . , Zg)^ where A3 is 
the matrix defined in ([3]). Thus one has to compute i?(Xi(0), . . . , X'^)^ where the 10 x 10 matrix 

R = (1/11)A3M. 
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We now show by direct computation that i? is a ToepUtz matrix. From the structure of A3, we have 

9 

I 

11 

9 



1 10 

1 ^ 



(7) 



k=0 



From ((61), using the appropriate ranges for the three terms we now get 

k=0 k=0 k=0 k=j 

9 9-i 

= -ioyi'o_,.+ E ^/ + E^^' 

s=ll-j s=0 

9 

= 5^y;-iii^o-i 

Finally, combining ([6]), ([Tl) and ([8]) gives 

Ri 



(8) 



11 



(9) 



s=0 



Since -Rj^j is a function of only i — j, R is a ToepUtz matrix. Thus Z' = RX' is computed as 

Y^9 Y' V^9 V' 

E9 -v^/ v^lO T/- 
i=0 Z^i=l 
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^9 • 
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Yl 


Y' 
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Y^9 y/ v^9 -1^/ 

Recall that is assumed zero if its index is outside the valid range from to 9. Thus in exactly 
one of the first two terms is valid for any combination of i and j. Fig. [T] illustrates the bilinear cyclic 
convolution algorithm of length 1 1 based on this discussion. 

The multiplication of the 10 x 10 Toeplitz matrix R with a vector can be obtained using the ToepUtz 
product algorithms of lengths 2 and 5. The matrix R can be split into four 5x5 submatrices and the 
vector X' can be split into two length-5 vectors. By the definition of Toeplitz matrices, the Toeplitz 
product RX' can be computed as 

RX' 

Although the cyclic convolution is derived over tUe real field, it can be easily converted to cUaracteristic- 
2 fields. Based on the method in [12], we multiply both sides of all equations above by 11 modulus 2. In 
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+ (i^2 - Ro)x'q 
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Fig. 1. 11 -point cyclic convolution based on AFT 



the converted form, X = Tx, Y = Ty, and z = SZ. Thus we obtain 11-point cycUc convolution over 
characteristic-2 fields. To find its bilinear form, we need the bilinear form of ToepUtz product of length 10. 
The bilinear form of length-5 Toeplitz product over characteristic-2 fields v = Q^'^^^R^'^^^ r ■ P'-'^^^u) 
is given in Appendix IH where • stands for pointwise multiplication. 

Based on the length-10 Toeplitz product, the bilinear form of 1 1-point cyclic convolution over GF(2™) 
is given by 



10 

Q(^5) g(T5) 

q(^5) q(^^) 



1 

R^^^^Uo 

i?(^5)ni 

i?(^^)n2 



Ty 



1 

p(^5)n3 

p(^5)n4 

p(^5)n5 



Tx 



Details of matrices S, T, Uq, . . . , lis are given in Appendix JIl 

The proposed length- 11 cyclic convolution needs only 43 multiplications. We compare it with cyclic 
convolutions of other lengths from [1], [13], [14] in Table J] 

III. Cyclotomic FFT over GF(2ii) 

Based on the derived 11-point cyclic convolution over GF(2™), we can construct a length-2047 
cyclotomic FFT over GF(2^^). In this manuscript, we focus on direct CFFT as in [5] since it was 
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TABLE I 

Multiplicative Complexity of Cyclic Convolution 



n 


2 


3 
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5 
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10 


11 


Mult. 
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9 


10 


12 


13 


27 


19 


30 


43 



shown in [9] all variants of CFFTs have the same multiplicative complexity and they have the same 
additive complexity under direct implementation. 

Given a primitive element a G GF(2"*), the DFT of a vector / = (/o, /i, . . . , fn-i)^ is defined as 

F ^ (/(aO), /(ai), . . . , f{a-^)f, where f{x) ^ ^^=0 fi^' ^ GF(2-)[x]. 

We choose the field generated by the polynomial x^^+x^+l.ln this field, there are one size- 1 coset and 
186 size-11 cosets. We permute the input / to /' such that /' = (/o, fi, /2, • • • , /ise) ^^^^ size-11 
vector /j contains the components of / whose indices are in the same coset (fcj, fcj2, . . . , fcj2™*^^) mod 
2047 where | 11 is the coset size. Thus the polynomial f{x) is divided into parts, each one is Li{x^^) = 
Yl^=o^ fki2i mod 2047 (s;'^')^'- Hcncc Li{x)'s are linearized polynomials. Each element a'^' can be decom- 
posed with respect to a basis /3j = (A,o, • • • , A,mi-i) such that a^''' = Y.l=Q(^i,j,sPi,s,ai,j,s e 
GF(2). So each component of DFT is factored into 

186 186 m, 186 10 10 

1=0 i=Q s=0 1=0 s=0 p=0 

In matrix form, it is F = aLf, in which L is a block diagonal matrix with each diagonal block being 

92 



A,o 
/3i,i 



A,m<-1 0i,mi-l ■ ■ ■ A,mi-1. 

Using a normal basis as the matrix Lj becomes a cychc matrix and i-i/j becomes a size-mj cyclic 
convolution. For length-2047 CFFT, is 1 or 1 1 . Thus we obtain a length-2047 CFFT using the bihnear 
form of 11 -point cycKc convolution as 



1 
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F = a 
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(11) 
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It requires 7812 multiplications to compute the constructed length-2047 CFFT. Under direct implemen- 
tation, it requires 2154428 additions. With incomplete optimization using the CSE algorithm [9], its 
additive complexity can be reduced to 529720. We compare its complexity with those of shorter CFFTs 
in Table HIl In Table HIl our numbers of additions for CFFTs of lengths 7, 15, • • • , 1023 are reproduced 
from [9]. 

TABLE II 

Complexity of Full Cyclotomic FFT 
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Mult. 
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Appendix I 
TOEPLiTZ Product of Length 5 

Toeplitz product of length 5 as 
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can be done in bilinear form as i> = Q^^^\R^'^^^r ■ P^'^^^u). 
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Appendix II 

Bilinear Form of 11-point Cyclic Convolution over Characteristic-2 Fields 
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